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Abstract 



We study the approximation problem (or problem of optimal recovery in the L2- 
norm) for weighted Korobov spaces with smoothness parameter a. The weights 7j 
of the Korobov spaces moderate the behavior of periodic functions with respect to 
successive variables. The non- negative smoothness parameter a measures the decay of 
Fourier coefficients. For a = 0, the Korobov space is the L2 space, whereas for positive 
a, the Korobov space is a space of periodic functions with some smoothness and the 
approximation problem corresponds to a compact operator. The periodic functions 
are defined on [0, l] d and our main interest is when the dimension d varies and may 
be large. We consider algorithms using two different classes of information. The first 
class A al1 consists of arbitrary linear functionals. The second class A std consists of only 
function values and this class is more realistic in practical computations. 

We want to know when the approximation problem is tractable. Tractability means 
that there exists an algorithm whose error is at most e and whose information cost is 
bounded by a polynomial in the dimension d and in e _1 . Strong tractability means that 
the bound does not depend on d and is polynomial in e~ l . In this paper we consider 
the worst case, randomized and quantum settings. In each setting, the concepts of 
error and cost are defined differently, and therefore tractability and strong tractability 
depend on the setting and on the class of information. 

In the worst case setting, we apply known results to prove that strong tractability 
and tractability in the class A al1 are equivalent. This holds iff a > and the sum- 
exponent s 7 of weights is finite, where s 7 = inf { s > : Yl'jLi lj < 00 } • 

In the worst case setting for the class A std we must assume that a > 1 to guarantee 
that functionals from A std are continuous. The notions of strong tractability and 
tractability are not equivalent. In particular, strong tractability holds iff a > 1 and 

Eoo , 
j=1 Jj < 00. 

In the randomized setting, it is known that randomization does not help over the 
worst case setting in the class A al1 . For the class A std , we prove that strong tractability 
and tractability are equivalent and this holds under the same assumption as for the 
class A al1 in the worst case setting, that is, iff a > and s 7 < 00. 

In the quantum setting, we consider only upper bounds for the class A std with 
a > 1. We prove that s 7 < 00 implies strong tractability. 

Hence for s 7 > 1, the randomized and quantum settings both break worst case 
intractability of approximation for the class A std . 

USAF, under agreement number F30602-01-2-0523. The U.S, Government is authorized to reproduce and 
distribute reprints for Governmental purposes notwithstanding any copyright annotation thereon. The views 
and conclusions contained herein are those of the authors and should not be interpreted as necessarily 
representing the official policies or endorsements, either expressed or implied, of the Defense Advanced 
Research Projects Agency (DARPA), the Air Force Research Laboratory, or the U.S. Government. 
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We indicate cost bounds on algorithms with error at most e. Let c(d) denote the 
cost of computing L(f) for L 6 A al1 or L 6 A std , and let the cost of one arithmetic 
operation be taken as unity. The information cost bound in the worst case setting 
for the class A al1 is of order c(d) • e~ p with p being roughly equal to 2 max(s 7 , a -1 ). 
Then for the class A std in the randomized setting, we obtain the total cost of order 
c(d) e~ p ~ 2 + de~ 2p ~ 2 , which for small e is roughly 

de- 2p - 2 . 

In the quantum setting, we present a quantum algorithm with error at most e that 
uses about only d + loge -1 qubits and whose total cost is of order 

(c(d) +d) e' 1 -^' 2 . 

The speedup of the quantum setting over the randomized setting is of order 

d (1 

c(d) + d \e 

Hence, we have a polynomial speedup of order e~( 1+ ?V 2 ). We stress that p can be 
arbitrarily large, and in this case the speedup is huge. 




1 Introduction 

We study the approximation problem (or problem of optimal recovery in the L2-norm) for 
periodic functions / : [0, l] d — > C that belong to Korobov spaces. These are the most 
studied spaces of periodic functions. Usually, the unweighted case, in which all variables 
play the same role, is analyzed. As in |12|, |23|, in this paper we analyze a more general 
case of weighted Korobov spaces, in which the successive variables may have diminishing 
importance. We consider the unit ball of weighted Korobov spaces Hj. Hence we assume 
that \\f\\d < 1 where the norm depends on a non-negative smoothness parameter a and a 
sequence 7 = {7-,} of positive weights. For a = we have \\f\\d = ll/llL 2 ([o,i] d )) an d for a > 
the norm is given by 

\ 1/2 

rr a (7,/>)|/>)| 2 ) 

where Z d = { . . . , —1, 0, 1, . . . } d , Fourier coefficients are denoted by f(h), and 

r a (l,h) = JJr a (7 i ,A i ) with r a (jj,hj) = j ^l^a |[ ^ ^ £ ^ 
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The smoothness parameter a measures the decay of the Fourier coefficients. It is known 
that the weighted Korobov space consists of functions that are kj times differentiable 
with respect to the jth variable if kj < a/2. For a > 0, the space Ha is a Hilbert space, and 
for a > 1, it is a Hilbert space with a reproducing kernel. 

The weights jj of Korobov spaces moderate the behavior of periodic functions with 
respect to successive variables. For < 1 and for small jj, we have large r Q (7,/i) with 
non-zero hj and therefore the corresponding Fourier coefficient \f(h)\ must be small. In the 
limiting case when jj approaches zero, all Fourier coefficients f(h) with non-zero hj must be 
zero, that is, the function / does not depend on the jth variable. 

We consider algorithms using different classes of information. We study the two classes 
A al1 and A std of information. The first one A al1 = H* d consists of all continuous linear func- 
tionals, whereas the second one A std , called the standard information, is more realistic in 
practical computations and consists only of function values, i.e., of L x (f) = f(x) V/ G Ha 
with x G [0, l] d . Such functionals are continuous only if a > 1. 

Our main interest is when the dimension d varies and may be large. In particular, we 
want to know when the approximation problem is tractable. Tractability means that there 
exists an algorithm whose error is at most e and whose information cost (i.e., the number of 
information evaluations from A al1 or A std ) is bounded by a polynomial in the dimension d and 
in e~ l . Strong tractability means that the bound does not depend on d and is polynomial 
in e -1 . The exponent of strong tractability is defined roughly as the minimal non- negative 
p for which the bound is of order e~ p . 

We consider the worst case, randomized and quantum settings. Each setting has its own 
definition of error, information and total cost. In the worst case setting we consider only 
deterministic algorithms, whose error, information and total costs are defined by their worst 
performance. In the randomized setting we allow randomized algorithms, and their error 
and costs are defined on the average with respect to randomization for a worst function 
from the unit ball of H^. In the quantum setting we allow quantum algorithms that run on 
a (hypothetical) quantum computer, with the corresponding definitions of error and costs. 
Clearly, the concepts of tractability and strong tractability depend on the setting and on 
the class of information. We are interested in checking how the setting and the class of 
information change conditions on tractability. 

The approximation problem corresponds to the embedding operator between the weighted 
Korobov space Hd and the space L 2 ([0, l] d ). This operator is compact iff a > 0. That is 
why for a = we obtain negative results in all three settings and for the two classes of 
information. 

In Section ^| we study the worst case setting. It is enough to consider linear algorithms 
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of the form 

n 

A n ,d(f) = ^2a k L k (f). 



k=l 



Here, the a^'s are some elements of L2QO, and the L^'s are some continuous linear 
functionals from A al1 or A std . The functions a k do not depend on /; they form the fixed output 
basis of the algorithm. Necessary and sufficient conditions on tractability of approximation 
in the worst case setting easily follow from [|1^, |27], |2"8fl . With 



inf < s > : ^ 7* < 00 > , 
< j= i ) 



we have: 



1. Let a > 0. Strong tractability and tractability of approximation in the class A al1 are 
equivalent, and this holds iff a > and the sum-exponent s 7 is finite. If so, the 
exponent of strong tractability is 

p*(A a11 ) = 2 max (s 7 , aT 1 ) 

2. Let a > 1. Strong tractability of approximation in the class A std holds iff 

00 

i=i 

If so, then p*(A a11 ) < 2 and the exponent of strong tractability p*(A std ) satisfies 

p*{A std ) e [p*(A a11 ), p*(A a11 ) +2]. 

3. Let a > 1. Tractability of approximation in the class A std holds iff 

a := hmsup — f — : — < 00. 



c(— > 00 



In d 



In particular, we see that for the classical unweighted Korobov space, in which 7^ = 1 for all 
j, the approximation problem is intractable. To break intractability we must take weights 
7j converging to zero with a polynomial rate, that is, 7^ = 0(j~ k ) for some positive k. Then 
s 7 < l/k. 
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In Section ^ we study the randomized setting. We consider randomized algorithms of the 
form 

A n4 (f,uj) = <p u (Li >UJ (f),L 2;U! (/),..., L n;U} (f)) , 

where u is a random element that is distributed according to a probability measure g, 
and Lk, w £ A with tp^ being a mapping /,From C n into ^([0, l] d ). The randomized error 
of an algorithm A^ is defined by taking the square root of the average value of ||/ — 
A n ,d(f, w )IIl 2 ([o i] 2 ) w ith respect to uj according to a probability measure g, and then by 
taking the worst case with respect to / from the unit ball of H^. 

It is known, see [fL5| |, that randomization does not help over the worst case setting for 



the class A . That is why, for the class A al1 , tractability and strong tractability in the 
randomized setting are equivalent to tractability and strong tractability in the worst case 
setting. For the class A std we prove: 

1. Strong tractability and tractability of approximation are equivalent, and this holds iff 
a > and s 7 < oo. In this case, the exponent of strong tractability is in the interval 
[p*(A a11 ), p*(A a11 ) + 2], where p*(A a11 ) = 2 max(s 7 , a" 1 ). 

2. For any p > p*(A a11 ), we present an algorithm A n ( i with n of order e - ( p + 2 ) and ran- 
domized error at most e. Let c(d) be the cost of computing one function value, and 
let the cost of performing one arithmetic operation be taken as unity. Then the total 
cost of the algorithm A n ^ is of order 

j \ p+2 , ^ N 2p+2 



c(d) I - 1 + d I -J Vrf= 1,2,... , G (0,1). 

Hence, the only dependence on d is through c(d) and d. Clearly, if d is fixed and e 
goes to zero then the second term dominates and the total cost of A n)d is of order 

i \ 2p+2 



d 

The essence of these results is that in the randomized setting there is no difference between 
tractability conditions when we use functionals from A al1 or from A std . This is especially 
important when s 7 > 1, since approximation is then intractable in the worst case setting for 
the class A std independently of a, and is strongly tractable in the randomized setting for the 
class A std . Hence for s 7 > 1, randomization breaks intractability of approximation in the 
worst case setting for the class A std . 

In Section [5] we study the quantum setting. We consider quantum algorithms that run on 
a (hypothetical) quantum computer. Our analysis in this section is based on the framework 
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for quantum algorithms introduced in || that is relevant for the approximate solution of 
problems of analysis. 

We only consider upper bounds for the class A std and weighted Korobov spaces with 
a > 1 and s 7 < oo. We present a quantum algorithm with error at most e whose total cost 
is of order 

/ , n l+3p/2 

(c(d)+d)(-\ Vd= 1,2,... , Vee (0,1) 

with p fa p*(A a11 ) being roughly the exponent of strong tractability in the worst case setting. 

The quantum algorithm uses about d + loge" 1 qubits. Hence, for moderate d and even 
for large e~ l , the number of qubits is quite modest. This is especially important, since the 
number of qubits will be a limiting resource for the foreseeable future. 

It is interesting to compare the results in the quantum setting with the results in the ran- 
domized setting for the class A std . The number of quantum queries is of order 5- 1 ~ 3 P*( Aa )/ 2 
which is smaller than the corresponding number e~ 2 ~ p of function values in the randomized 
setting only if p*(A a11 ) = 2 max(s 7 , a" 1 ) < 2. This holds when s 7 < 1, since a > 1 has 
been already assumed. However, the number of quantum combinatory operations is always 
significantly smaller than the corresponding number of combinatory operations in the ran- 
domized settings. If d is fixed and e goes to zero then the total cost bound in the randomized 
setting is of order de~ 2p ~ 2 which is significantly larger than the total cost bound of order 
(c(d) + d)s~ 1 ~ 3p / 2 in the quantum setting. This means that the exponent of e _1 in the cost 
bound in the quantum setting is 1 + p/2 less than the exponent in the randomized setting. 
We do not know whether our upper bounds for the quantum computer can be improved. 

The speedup of the quantum setting over the randomized setting, defined as the ratio of 
the corresponding randomized and quantum costs, is of order 

d fl 
c(d) + d \e 

Hence, we have a polynomial speedup of order e~( 1+ p/ 2 \ If p*(A a11 ) is close to zero, we may 
also take p close to zero and then the speedup is roughly But p*(A a11 ) can be arbitrarily 
large. This holds for large s T In this case p is also large and the speedup is huge. 

We finish our paper with two appendices. The first is about a general framework for 
quantum algorithms and the second contains a proof of the fact that weighted Korobov 
spaces are algebras. This fact is crucial for our upper bounds for quantum algorithms and 
hence for Theorem f|. 
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2 Approximation for Weighted Korobov Spaces 



In this section we define approximation for periodic functions from the weighted Korobov 
space Hd- The space Hd is a Hilbert space of complex-valued L 2 functions defined on [0, l] d 
that are periodic in each variable with period 1. The inner product and norm of H d are 
defined as follows. We take a sequence 7 = {7,} of weights such that 

1 > 7i > 72 > • • • > 0. 

Let a > 0. For h = [hi, h 2 , ■ ■ ■ , h d ] e Z d define 



where s = 1 for a > 0, and s = for a = 0. Note that r Q (7, h) > 1 for all /i G Z d , and the 
smallest r a (7, /i) is achieved for h = and has the value 1. 
The inner product in Hd is given by 




hez d 



where h — (hi, ... , hd), and f(h) is the Fourier coefficient 




exp (— 2ni h ■ x) f(x) dx, 



with h ■ x = hiXi + • • • + hdXd- The inner product in Hd can be also written as 



(f,g) d = f(o)m + r a (i,h)f(h) g (h), 



thus the zeroth Fourier coefficient is unweighted. The norm in Hd is 



11/11 




Note that for a = we have r (7, h) = 1, and 
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Hence, in this case H d = L 2 ([0, l] d ) is the space of square integrable functions. Observe that 
for any a > we have H d C L 2 ([0, l] d ) and 

H/lkdMW ^ Wf\U V/Gff, 

For a > 1, the space H d is a reproducing kernel Hilbert space, see fl], p6|l . That is, 
there exists a function lf d : [0, l] d x [0, l] ' — > C, called the reproducing kernel, such that 
G# d for ally G [0,l] d , and 

f(y) = (f,K d (-,y)) d VfEH d , \/y G [0, l] d . 

The essence of the last formula is that the linear functional L y (f) = f(y) for / G H d is 
continuous and its norm is 

||LJ = K l d /2 (y,y) Vy G [0, 

It is known, see e.g. pUfl , that the reproducing kernel liQ is 

^ exp (2vrz/i ■ (x - y)) 
Kd ^ y) =T. VJ^h) • 



(2) 



This can be rewritten as 

exp (2nih(xj — Vj)) tt / cos (2irh(xj — Vj)) 



Ki(xy) = SJ „ = n'^,E 

J= lh=-oc 'aXlji' 1 ) j=1 \ h=1 



Hence, K d (x, y) depends on x — y and takes only real values. From this we have 

d 

Kd(v,v) = H(l + 2 lj ((a)), 

3=1 

where ( is the Riemann zeta function, ((a) = J2T=i h~ a . Hence, a > 1 guarantees that 
K d (y,y) is well defined and that \\L y \\ is finite. 

We return to the general case for a > 0. For jj = 1, the space H d is the L 2 version of 
the (unweighted) Korobov space of periodic functions. For general weights jj, the space H d 
is called a weighted Korobov space. 

We now explain the role of weights jj. Take / G H d with \\f\\ d < 1. For small values 
of jj we must have small Fourier coefficients f(h) with hj ^ 0. Indeed, ||/||d < 1 implies 
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that r Q (7, h)\f(h)\ 2 < 1, and for hj ^ this implies that \f(h)\ 2 < 7j/|/ij| a < 7j, as claimed. 
Thus, small 7-,'s correspond to smoother functions in the unit ball of H d in the sense that 

~ 1/2 

the Fourier coefficients f(h) with /ij ^ must scale like 7- in order to keep ||/||d < 1. 

The spaces if^ are related to each other when we vary d. Indeed, it is easy to check that 
for di < d 2 we have 

H dl C H d2 and ||/|| dl = ||/|| d2 V/ G 

That is, a function of c?i variables from iJ^ , when treated as a function of c?2 variables with 
no dependence on the last d 2 — d\ variables, also belongs to H d2 with the same norm as in 
H dl . This means that we have an increasing sequence of spaces H ± C H 2 C • • • C H d , and 
an increasing sequence of the unit balls of H d , B ± C B 2 C • • • C B d , and H dl fl B d2 = B dl 
for di < d 2 . 

So far we assumed that all weights 7, are positive. We can also take zero weights as the 
limiting case of positive weights when we adopt the convention that 0/0 = 0. Indeed, if one 
of the weights tends to zero, say r y d — > 0, then r a (7, h) goes to infinity for all h with h d 7^ 0. 
Thus to guarantee that remains finite we must have f(h) = for all h with h d 7^ 0. 
This means that / does not depend on the x d coordinate. Similarly, if all the weights 7, are 
zero for j > k then a function / from H d does not depend on the coordinates Xk, Xk+i, ■ ■ ■ , x d . 

We are ready to define multivariate approximation (simply called approximation) as the 
operator APP d : H d — > L 2 ([0, l] d ) given by 

APP d f = f. 

Hence, APP^ is the embedding from the Korobov space H d to the space L 2 ([0, l] d ). It is 
easy to see that ||APPd|| = 1; moreover APP^ is a compact embedding iff a > 0. Indeed, 
consider the operator W d := APP^ APP d : H d -> H d , where APP^ : L 2 ([0, l] d ) -> H d is the 
adjoint operator to APP d . Then for all f,g G H d we have 

(W d f,g) d = (APP d f,APP d g) L2{[01]d) = (f,g) L2m]d) . 

From this we conclude that 

W d f h = r- 1 ( 7 ,/^)A \/heZ d , 

where fh(x) = exp (2irih ■ x) /r l J 2 {^, h). We have \\fh\\ d = 1 and span(// l : h G Z d ) is dense 
in L 2 ([0, l] d ). This yields that W d has the form 

(W d f)(x) = h) f(h) exp (2nih ■ x) V / G H d , (3) 

hez d 
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where for a G [0, 1] the convergence of the last series is understood in the L 2 sense. 

Thus, Hd has an orthonormal basis consisting of eigenvectors of Wd, and r~ 1 (7, h) is the 
eigenvalue of Wd corresponding to fh for h G Z d . Clearly, 

l|APP d /|| X2([0jl]d) = (W d f,f)d /2 V/ e H d , 

and therefore, since Wd is self adjoint, 

/ \ 1/2 
||APP d || = \\W d \\ 1/2 = maxr^/i) = 1. 

\hez d J 

For a = we have APP d = Wd and both are the identity operator on L 2 ([0, and 
therefore they are not compact. In contrast, for a > 0, the eigenvalues of Wd go to zero as 
\h\ = \hi\ + \h 2 \ + • • • + | hd | goes to infinity, and therefore the operator Wd is compact and 
APPrf is a compact embedding. 



3 Worst Case Setting 

In this section we deal with tractability of approximation in the worst case setting. To recall 
the notion of tractability we proceed as follows. We approximate APP^ by algorithms^ of 
the form 

n 

KM) = ^2a k L k (f). 

k=l 

Here, the a^'s are some elements of L 2 ([0, and the L k s are some continuous linear 
functionals defined on Hd- Observe that the functions a k do not depend on /, they form the 



fixed output basis of the algorithm, see [ 18[ . For all the algorithms in this paper we use the 



optimal basis consisting of the eigenvectors of Wd- We assume that L k £ A, and consider 
two classes of information A. The first class is A = A al1 = which consists of all continuous 
linear functionals. That is, L G A al1 iff there exists g G Hd such that L(f) = (f,g) d f° r a h 
/ G Hd- The class A al1 is well defined for all a > 0. The second class A = A std is called 
standard information and is defined only for a > 1, 

A = A std = {L x : x G [0, If with L x (f) = f(x) WfeH d }. 

Hence, the class A std consists of function evaluations. They are continuous linear functionals 
since Hd is a reproducing kernel Hilbert space whenever a > 1. 



1 It is known that nonlinear algorithms as well as adaptive choice of L k do not help in decreasing the 
worst case error, see e.g., p4|. 
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The worst case error of the algorithm A n ^ is defined as 



e wor (A M ) = sup{\\f-A n M)\\L 2{[ om : feH d , \\f\U < 1} 



APP d -^a k L k 



k=l 



Let comp wor (e, H^, A) be the minimal n for which we can find an algorithm A n4 , i-e., find 
elements a k G L 2 ([0, l] d ) and functional L k G A, with worst case error at most e||APPd||, 
that is, 

comp wor (£,^,A) = minjri : 3 A n4 such that e WOT (A n4 ) < e \\APP d \\ }. 

Observe that in our case ||APP^|| = 1 and this represents the initial error that we can achieve 
by the zero algorithm A n d = without sampling the function. Therefore e||APP^|| = e can 
be interpreted as reducing the initial error by a factor e. Obviously, it is only of interest to 
consider e < 1. 

This minimal number comp wor (e, Hd, A) of functional evaluations is closely related to the 
worst case complexity of the approximation problem, see e.g., 24]. This explains our choice 
of notation. 



We are ready to define tractability, see p9| . We say that approximation is tractable in 
the class A iff there exist nonnegative numbers C, p and q such that 

comp wor (£,# d ,A) < Ce- p d q ~i e G (0, 1), Vrf G N. (4) 

The essence of tractability is that the minimal number of functional evaluations is bounded 
by a polynomial in e -1 and d. 

We say that approximation is strongly tractable in the class A iff q — in (H). Hence, 
strong tractability means that the minimal number of functional evaluations has a bound 
independent of d and polynomially dependent on e _1 . The infimum of p in (|^) is called the 
exponent of strong tractability and denoted by p* = p*{A). That is, for any positive 5 there 
exists a positive C$ such that 

comp wor (e,# d , A) < C s e- {p ' t+s) Ve G (0,1), Vd G N 

and p* is the smallest number with this property. 

Necessary and sufficient conditions on tractability of approximation in the worst case 
setting easily follow from |12], 28[]. In order to present them we need to recall the notion 
of the sum-exponent s 7 of the sequence 7, see |27[| , which is defined as 



00 



s 7 = inf \ s > : } 7? < 00 }, (5) 
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with the convention that the infimum of the empty set is taken as infinity. Hence, for the 
unweighted case, jj = 1, we have s 7 = oo. For jj = 0(j _K ) with k > 0, we have s 7 = l//t. 
On the other hand, if s 7 is finite then for any positive 5 there exists a positive M$ such that 
k lk +& < T,7=ilj" +S < M s- Hence, lk = 0(k~ 1/( - s " /+5 ' ) ). This shows that s 7 is finite iff 7i 
goes to zero polynomially fast in j , and the reciprocal of s 7 roughly measures the rate of 
this convergence. 

We begin with the class A al1 . Complexity and optimal algorithms are well known in this 
case, see e.g., P3) . Let us define 

R(e,d) = {heZ d : r- 1 ( 7 ,/i) > e 2 }. (6) 

as the set of indices h for which the eigenvalues of Wd, see (0), are greater than e 2 . Then 
the complexity comp wor (£, Hd, A al1 ) is equal to the cardinality of the set R(e, d), 

comp wor (£,# d ,A a11 ) = \R(e,d)\, (7) 



and the algorithm 



A n4 (f)(x) = f{h) exp(2mh-x) 

h€R(e,d) 



with n = \R(s, d)\ is optimal and has worst case error at most e. This simply means that 
the truncation of the Fourier series to terms corresponding to the largest eigenvalues of Wd 
is the best approximation of the function /. 

For a = all eigenvalues of Wd have the value 1. Thus for e < 1 we have infinitely many 
eigenvalues greater than e 2 even for d — 1. Therefore the cardinality of the set R(e, 1) and 
the complexity are infinite, which means that approximation is not even solvable, much less 
tractable. For a > and d — 1, we obtain 

comp" ^,^,^ 11 ) w 2 1 \ /a e- 2/a . 

It is proven in [ETfl that strong tractability and tractability are equivalent, and this holds 



iff s 7 is finite. Furthermore, the exponent of strong tractability is p*(A a11 ) = 2 max (s 7 , a -1 ). 
We stress that the exponent of strong tractability is determined by the weight sequence 
7 if s 7 > a" 1 . On the other hand, if s 7 < a^ 1 then p*(A a11 ) = 2a _1 , and this exponent 
appears in the complexity even when d — 1. For such weights, i.e., s 7 < a" 1 , multivariate 
approximation in any number of variables d requires roughly the same number of functional 
evaluations as for d — 1. 

We now turn to the class A std and assume that a > 1. Formally, tractability of approx- 
imation in the class A std has not been studied; however, it is easy to analyze this problem 
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based on the existing results. First, observe that approximation is not easier than multivari- 
ate integration (or simply integration) defined as 



INT d (/) = / f(x)dx = f(0) VfeH d . 
J[o,i] d 

Indeed, ||INT d || = 1, and for any algorithm A n>d (f) = Ylk=i a kf( x k) for some a k e L 2 ([0, l] d ) 
and some x k € [0, l] d , we have 

n 

\\APP d f - A n>d (f)\\l 2m]d) = Y,\f(h)-XUfW\ 2 > /(0) - £>/(:■*) 

h& d k=l 

with bk = fr 1 j d a k {x) alx. Hence, it is not easier to approximate APP^ than INT^, and neces- 
sary conditions on tractability of integration are also necessary conditions on tractability for 
approximation. It is known, see fl2]| , that integration is strongly tractable iff Y^jLxlj < 00 > 
and is tractable iff a := limsup^^ X^=i Jj/ In d < oo. Hence, the same conditions are also 
necessary for tractability of approximation. Due to [p8fl , it turns out that these conditions 



are also sufficient for tractability of approximation. More precisely, if ^2JLilj < 00 j then 
approximation is strongly tractable and its exponent p*(A std ) G [p*(A all ),p*(A a11 ) + 2], see 
Corollary 2 (i) of |28[. Clearly, in this case p*(A a11 ) < 2. 

Assume that a e (0, oo). Then there exists a positive M such that 

d 

c?7d/ln d < VV'/ln d < M 

3=1 

for all d. Hence, jj = 0(j _1 In j), and clearly s 7 = 1. Once more, by Corollary 2 (i) of pSf , 
we know that for any positive 5 there exists a positive number C$ such that the worst case 
complexity of approximation is bounded by Cs£^^ 2+ ^d^^ a+s . This proves tractability of 
approximation. We summarize this analysis in the following theorem. 

Theorem 1 Consider approximation APP^ : H d — > L 2 ([0, l] d ) in the worst case setting. 

1. Let a > 0. Strong tractability and tractability of approximation in the class A al1 are 
equivalent, and this holds iff < oo and a > 0. In this case, the exponent of strong 
tractability is 

p*(A a11 ) = 2 max^a" 1 ) . 
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2. Let a > 1 . Strong tractahility of approximation in the class A std holds iff 

oo 

When this holds, thenp*(A a,n ) < 2 and the exponent of strong tractahility 

p*(A std ) e [p*(A all ),p*(A a11 ) + 2]. 

3. Let a > 1. Tractahility of approximation in the class A std holds iff 

Ed 

a := limsup — f — - — < oo. 

d^oo In d 

When this holds, for any positive 5 there exists a positive Cs such that 

comp WOI (s,H d ,A std ) < C s e- {2+5) d^ {a)a+s V (2= 1,2, ... , Ve E (0, 1), 
where ( is the Riemann zeta function. 

4 Randomized Setting 

In this section we deal with tractahility of approximation in the randomized setting for the 
two classes A al1 and A std . The randomized setting is precisely defined in [24|. Here we only 
mention that we consider randomized algorithms 



A n ,d(f,uj) = <p u \Lifi,(f),L 2fi ,(f ),..., L nill) (f) 

where to is a random element that is distributed according to a probability measure g, and 
Lk,u £ A with being a mapping ^From C n into ^([0, The essence of randomized 
algorithms is that the evaluations, as well the way they are combined, may depend on a 
random element. The primary example of a randomized algorithm is the standard Monte 
Carlo for approximating multivariate integration which is of the form 



1 n 

A n4 (f,uj) = - 2j/(wfc) 



n 

k=l 



where u = [ui, lj 2 , . . . , u n ] with independent and uniformly distributed over [0, l] d which 
requires nd random numbers from [0, 1]. In this case, L^^if) = f{^k) are function values at 
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random sample points, and <Pu(yi, JJ2, ■ ■ ■ , y n ) — n ~ 1 Y^l=i Vk does not depend on u and is a 
deterministic mapping. 

The randomized error of the algorithm A n ^ is defined as 

e ra %4 M ) = suv{EV 2 (\\f-A n M,")\\l 2 ([om) : f ^ H d , \\f\\ d < l] . 

Hence, we first take the square root of the average value of the error ||/ — A njd (f, u)\\^ 2 ^ ^ 
with respect to u according to the probability measure g, and then take the worst case with 
respect to / from the unit ball of H d . 

Let comp ran (£, A) be the minimal n for which we can find an algorithm A n ^, i.e., a 
measure g, functionals and a mapping ip^, with randomized error at most e. That is, 

comp ran (e,H d ,A) = min { n : 3 A n4 such that e ran (A n4 ) < e }. 

Then tractability in the randomized setting is defined as in the paragraph containing @, 
with the replacement of comp wor (e, Hd, A) by comp ran (e, Hd, A). 

We are ready to discuss tractability in the randomized setting for the class A al1 . It is 
proven in that randomization does not really help for approximating linear operators 
over Hilbert space for the class A al1 since 

comp wor (2 1 / 2 £,if d ,A a11 ) < comp ran (e,^,A a11 ) < comp wor (e, H d , A al1 ), 

and these estimates hold for all e G (0, 1) and for all dsN. 

This means that tractability in the randomized setting is equivalent to tractability in the 
worst case setting, and we can use the first part of Theorem 1 to characterize tractability 
also in the randomized setting. 

We now turn to the class A std . It is well known that randomization may significantly help 
for some problems. The most known example is the standard Monte Carlo for multivariate 
integration of d variables, which requires at most e~ 2 random function values if the L 2 norm 
of a function is at most one, independently of how large d is. 

We now show that randomization also helps for approximation over Korobov spaces, and 
may even break intractability of approximation in the worst case setting. As we shall see, 
this will be achieved by a randomized algorithm using the standard Monte Carlo for approx- 
imating the Fourier coefficients corresponding to the largest eigenvalues of the operator Wd 
defined by (|3]). To define such an algorithm we proceed as follows. 

We assume that a > 1 so that the class A std is well defined. Without loss of generality we 
also assume that approximation is tractable in the class A al1 , which is equivalent to assuming 
that s-y < 00. 
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We know from Section 2 that R(e/2 1 / 2 , d) is the set of indices h for which the eigenvalues 
of Wd are greater than e 2 /2, see (^). We also know that the cardinality of the set R(e/2 1 ^ 2 , d) 
is exactly equal to comp wor (e/2 1//2 , Hd, A al1 ) and that for any positive 5 there exists a positive 
C$ such that 

\R{e/2 l ' 2 ,d)\ = comp wor (e/2 1 / 2 ,i7 d ,A a11 ) < C s £ -0>*( AalI )+<5) Vd = 1,2, . . . , Ve G (0,1), 



with p*(A a11 ) = 2 max(s 7 , a" 1 ) 
We want to approximate 



fix) = J2f{h)exp(2mh-x) 

h& d 



for / G Hd- The main idea of our algorithm is to approximate the Fourier coefficients f(h) 
for h G R(e/2 1 / 2 ,d) by the standard Monte Carlo, whereas the Fourier coefficients f(ti) for 
h R(e/2 1 ^ 2 ,d) are approximated simply by zero. That is, the algorithm A n d takes the 
form 

A n ,d(f,u)(x) = ^2 l-^Tf{uJk)exp(-2iTih-uj k ) J exp (2vri/i • x) , (9) 

heR(e/2 1 / 2 ,d) \ k=l J 

where, as for the standard Monte Carlo, uj = (u>i, u>2, ■ ■ ■ , oj n ) with independent and uniformly 
distributed over [0, l] d . 

The last formula can be rewritten as 

A n4 {f,u){x) = ^^fM ( Yl exp(-2irih-{x-u k )\\. (10) 

n fc=l \fte J R(e/2 1 /2 i rf) V J J 

From ([TOj) it is clear that the randomized algorithm A n> d uses n random function values. 

We are ready to analyze the randomized error of the algorithm A n> d- First of all observe 
that 

TL 2 

/ \f{x)-A n4 {f^)\ 2 dx = f(h)—J2f(u k )e- 2 ^ + iM)!* 

lU ' 1 /iG-R(e/2 1 /2 i d) fc=l ^7?( £ /2 1 /2 id ) 



We now compute the average value of the last formula with respect to tu. Using the well 
known formula for the Monte Carlo randomized error we obtain 

E INT.d/l y |/W + E 

h£R(e/2 1 / 2 ,d) '' h<£R(s/2 1 / 2 ,d) 
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Since INT d (|/| 2 ) = £ fceZd |/>)| 2 < ||/|| 2 , and 



£ r„( 7 ,/i)|/(/i)|Vr«(7^) 



fc^R(e/2 1 /2 j rf) 



h(£R(e/2 1 / 2 ,d) 



< 



\e 2 r a ( % h)\f(h)\ 2 < ls 2 \\f\\l 



h<£R(e/2 1 / 2 ,d) 



the error of A ni< j satisfies 



e ran (A M ) 2 < 



|fl(e/ 



2 1 /* d)| 




n 



Taking 



n 



2|i?( g /2V 2 ,d)| 

£ 2 




(2+p*(A all )+<5) 



) 



(11) 



we conclude that the error of A n ^ is at most e. This is achieved for n given by (|TID , which 
does not depend on d, and which depends polynomially on e -1 with an exponent that exceeds 
the exponent of strong tractability in the class A al1 , roughly speaking, by at most two. This 
means that approximation is strongly tractable in the class A std under exactly the same 
conditions as in the class A al1 . 

We now discuss the total cost of the algorithm A n ^. This algorithm requires n function 
evaluations f{uJk)- Since u>k is a vector with d components, it seems reasonable to assume 
that the cost of one such function evaluation depends on d and is, say, c(d). Obviously, c(d) 
should not be exponential in d since for large d we could not even compute one function 
value. On the other hand, c(d) should be at least linear in d since our functions may depend 
on all d variables. Let us also assume that we can perform combinatory operations such as 
arithmetic operations over complex numbers, comparisons of real numbers, and evaluations 
of exponential functions. For simplicity assume that the cost of one combinatory operation 
is taken as unity. Hence, for given h and Uk, we can compute the inner product h ■ Uk and 
then exp(— 2irih ■ u>k) in cost of order d. 

The implementation of the algorithm A n ^ can be done as follows. We compute and 
output 




fc=i 



for all h e R{e/ 



2 1//2 , d). This is done in cost of order 



nc(d) + nd\R(e/2 



'\d)\. 
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Knowing the coefficients we can compute the algorithm A n ^ at any vector 16 [0, l] d as 

An,d(f,w)(x) = ^ Vh exp (2nih ■ x) 

h<=R(s/2 1 / 2 ,d) 

with cost of order d \R(e/2 1 ^ 2 , d)\. Using the estimates on \R(e/2 1 / 2 , d)\ and n given by (]TT|), 
we conclude that the total cost of the algorithm A n ^ is of order 

/ 1 \ P+2 / -, \ 2p+2 

(-) e(«0 + ( ; ) d 

with p = p*(A a11 ) + 5. Hence, the only dependence on d is through c(d) and d. We stress the 
difference in the exponents of the number of function values and the number of combinatory 
operations used by the algorithm A n ^. For a fixed e and varying d, the first term of the cost 
will dominate the second term when c(d) grows more than linearly in d. In this case the 
first exponent p + 2 determines the total cost of the algorithm A n ^. On the other hand, for 
a fixed d and e tending to zero, the opposite is true, and the second term dominates the first 
term of the cost, and the second exponent 2p + 2 determines the cost of An,d- We summarize 
this analysis in the following theorem. 

Theorem 2 Consider approximation APP^ : Hd — > L2QO, zn the randomized setting. 

1. Let a > 0. Strong tractability and tractability of approximation in the class A al1 are 
equivalent, and this holds iff Sj < 00 and a > 0. When this holds, the exponent of 
strong tractability is 

j9*(A a11 ) = 2 max(s 7 ,a- 1 ) . 

2. Let a > 1 . Strong tractability and tractability of approximation in the class A std are 
equivalent, and this holds under the same conditions as in the class A all ; that is, iff < 
00. When this holds, the exponent of strong tractability p* (A std ) G [p*(A all ),p*(A all )+2]. 

3. The algorithm A n ^ defined by ^) with n given by ( jiiy of order roughly £~(p*( Aall )+ 2 ) 
approximates APP^ with randomized error at most e. For any positive 5 there exists a 
positive number K$ such that the total cost of the algorithm A n ^ d is bounded by 

K s(^j P c(d) + Q P dj Vrf=l,2,..., Vee(0,l), 

with p = p*(A a11 ) + 5. 
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We now comment on the assumption a > 1 that is present for the class A std . As we 
know from Section 3, this assumption is necessary to guarantee that function values are 
continuous linear functionals and it was essential when we dealt with the worst case setting. 
In the randomized setting, the situation is different since we are using random function 
values, and the randomized error depends only on function values in the average sense. This 
means that f(x) does not have to be well defined everywhere, and continuity of the linear 
functional L x (f) = f(x) is irrelevant. Since for any a > 0, the Korobov space H a is a subset 
of 1/2 ([0, l] d ), we can treat / as a Li function. This means that in the randomized setting 
we can consider the class A std for all a > 0. 

Remark 1 This is true only if we allow the use of random numbers ^From [0, 1]. If we only 
allow the use of random bits (coin tossing as a source of randomness) then again we need 
function values to be continuous linear functionals, which is guaranteed by the condition 



a > 1, see for a formal definition of such "restricted" Monte Carlo algorithms. We add 
that it is easy to obtain random bits from a quantum computer while it is not possible to 
obtain random numbers from [0, 1]. 

Observe that the algorithm A n .d is well defined for any a > since the standard Monte 
Carlo algorithm is well defined for functions from L2QO, Furthermore, the randomized 
error analysis did not use the fact that a > 1, and is valid for all a > 0. For a = 
the analysis breaks down since n given by (|TT|) would then be infinite. Even if we treat 
functions in the L 2 sense tractability requires that s 7 be finite. Indeed, for s 1 = 00 we 
must approximate exponentially^ many Fourier coefficients which, obviously, contradicts 
tractability. We summarize this comment in the following corollary. 

Corollary 1 Consider approximation APP^ : Hd — > L2([0,l] d ) in the randomized setting 
with a G [0, 1] in the class A std . 

1. Strong tractability and tractability of approximation are equivalent, and this holds iff 
a > and s 7 < 00. When this holds, the exponent of strong tractability is in the 
interval [p,p + 2], where p = p*(A a11 ) = 2max(s 7 , a -1 ). 

2. The algorithm A n ^ defined by (fj^) with n given by ( \T1\ ) of order roughly e~(p*( Aall )+ 2 ) 
approximates APP^ with randomized error at most e. 

The essence of these results is that in the randomized setting there is no difference between 
tractability conditions when we use functionals from A al1 and when we use random function 

2 We follow a convention of complexity theory that if the function grows faster than polynomial then we 
say it is exponential. 
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values. This is especially important when s 7 > 1, since approximation is then intractable 
in the worst case setting for the class A std independently of a. Thus we have the following 
corollary. 

Corollary 2 Let s 1 > 1. For the class A std , randomization breaks intractability of approxi- 
mation in the worst case setting. 



5 Quantum Setting 

Our analysis in this section is based on the framework introduced in || of quantum algorithms 
for the approximate solution of problems of analysis. We refer the reader to the surveys [|J, 
ZTJ , and to the monographs M, |nj, and |21J for general reading on quantum computation. 
This approach is an extension of the framework of information-based complexity theory 
33J and, more formally, [IB|) to quantum computation. It also extends the binary 
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black box model of quantum computation (see ||) to situations where mappings on spaces 
of functions have to be computed. Some of the main notions of quantum algorithms can be 
found in Appendix 1. For more details and background discussion we refer to ||. 

5.1 Quantum Summation of a Single Sequence 

We need results about the summation of finite sequences on a quantum computer. The 
summation problem is defined as follows. For iV 6 N and 1 < p < oo, let denote the 
space of all functions g : {0, 1, . . . , N — 1} — > R, equipped with the norm 



N-l 



i/p 



= [ \9U)\ P J if><oo, tmd \\g\\ L N= o nm_ i \g(j)\. 



Define S N : L" -»• R by 



p 



1 ^ 



and let 



j=0 



B<; := {g E Lp I \\g\\ L » < 1}. 



Observe that Sn(Bp) = [— 1, 1] for allp and N. We wish to compute A(g,e) which approxi- 
mates S]\r(g) with error e and with probability at least |. That is, A(g, e) is a random variable 
which is computed by a quantum algorithm such that the inequality {Swig) — A(g,e)\ < e 
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holds with probability at least |. The performance of a quantum algorithm can be summa- 
rized by the number of quantum queries, quantum operations and qubits. These notions are 
defined in Appendix 1. Here we only mention that the quantum algorithm obtains informa- 
tion on the function values g(j) by using only quantum queries. The number of quantum 
operations is defined as the total number of bit operations performed by the quantum al- 
gorithm. The number of qubits is defined as m if all quantum operations are performed in 
the Hilbert space of dimension 2 m . It is important to seek algorithms that require as small 
a number of qubits as possible. 

We denote by e^(S^,F) the minimal error (in the above sense, of probability > |) that 
can be achieved by a quantum algorithm using only n queries. The query complexity is 
defined for e > by 

comp qq (e, S N , F) = min{ n | e q n (S N , F) < e}. 

The total (quantum) complexity comp qua (£:, Sn, F) is defined as the minimal total cost of 
a quantum algorithm that solves the summation problem to within e. The total cost of a 
quantum algorithm is defined by counting the total number of quantum queries plus quantum 
operations used by the quantum algorithm. Let c be the cost of one evaluation of g(j). It is 
reasonable to assume that the cost of one quantum query is taken as c + m since <?(j)'s are 
computed and m qubits are processed by a quantum query, see Appendix 1 for more details. 

The quantum summation is solved by the Grover search and amplitude estimation algo- 
rithm which can be found in |J and ||. This algorithm enjoys almost minimal error and 
will be repetitively used for approximation as we shall see in Sections 5.2 and 5.3. 

Let us summarize the known results about the order of e£(Sjv, Bp) for p = oo and p = 2. 



The case p = oo is due to M, (upper bounds) and |T3| (lower bounds). The results in the 



case p = 2 are due to ||. Further results for arbitrary 1 < p < oo can be also found in 



and [pTT | . In what follows, by "log" we mean the logarithm to the base 2. 



Theorem 3 There are constants Cj > for j G {1, . . . , 9} such that for all n,N 6 N with 
2 < n < c\N we have 

el{S N ,B^)^n- 1 

and 

c 2 n~ l < e q n (S N ,B^) < c z n~ l log 3/2 n ■ log log n. 
For e < e < | ; we have 

comp qq (e, S N ,B^)~ mm(N, e' 1 ) 

and 

c 4 min(A^,£- 1 ) < comp qq (e, S N , B%) < c 5 min(iV, e' 1 log 3/2 e~ l ■ log log e" 1 ). 
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For N > e , the algorithm for the upper bound uses about log N qubits and the total com- 
plexity is bounded by 

cecs- 1 < comp qua (e, S N , B%) < c 7 c e' 1 ■ log N 

and 

cgce" 1 < comp qua (£:, S N ,B^) < c 9 ce' 1 log 3/2 e' 1 ■ log log e" 1 ■ logiV. 

So far we required that the error is no larger than e with probability at least |. To 
decrease the probability of failure ^From ~ to, say, e _£//8 one can repeat the algorithm t 
times and take the median as the final result. See Lemma 3 of [§| for details. 

We also assumed so far that H^Hz^ < 1. If this bound is changed to, say, ||<7||ia < M 
then it is enough to rescale the problem and replace g(j) by g(j)/M. Then we multiply the 
computed result by M and obtain the results as in the last theorem with e replaced by Me. 



5.2 The Idea of the Algorithm for Approximation 

The starting point of our quantum algorithm for approximation is a deterministic algorithm 
on a classical computer that is similar to the randomized algorithm given by (H), namely 

(l N \ 

A N4 (f)(x) = ^2 I f( x i"> exp (~ 27Tih ' x i) ) ex P ( 2nih ■ x ) » ( 12 ) 

heR(e/3,d) V j=l J 

where the x%, . . . ,xn come from a suitable deterministic rule, and R(-, d) is defined by (|6|). 
The error analysis of A^^ will be based on three types of errors. The first error arises from 
replacing the infinite Fourier series by a finite series over the set R(e/3, d); this error is e/3. 
The second error is made since we replace the Fourier coefficients which are integrals by a 
quadrature formulas We will choose N and the deterministic rule for computing xj in such 
a way that the combination of these two errors yields 

\\A N M) ~ /IU 2 ([o,i]«) < |e VfeH d , 11/11, < 1. (13) 

This will be possible (see (|22|) below) if N is, in general, exponentially large in d. This may 
look like a serious drawback, but the point is that we do not need to exactly compute the 
sums in (]TJ). Instead, the sums 



exp {-2mh ■ Xj) ) (14) 

h€R(e/3,d) 
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will be approximately computed by a quantum algorithm whose cost depends only logarith- 
mically on N. We have to guarantee that this third (quantum) error is bounded by e/3, with 
probability at least |. As we shall see, logiV will be at most linear in d and polynomial in 
\oge~ 1 , which will allow us to have good bounds on the total cost of the quantum algorithm. 

Remark 2 Observe that the \R(e/3,d)\ sums given by fllil) depend only on iV function 
values of /, whereas h takes as many values as the cardinality of the set R(e/3,d). Since 
each function value costs c(d), and since c(d) is usually much larger than the cost of one 
combinatory operation, it seems like a good idea to compute all sums in ([14]) simultaneously. 
We do not know how to do this efficiently on a quantum computer and therefore compute 
these sums sequentially. 



5.3 Quantum Summation Applied to our Sequences 

As outlined in the previous subsection, for the approximation problem we need to compute 
Si\i{gh) for several sequences gi, g 2 , . . . , ga each of length N with R = \R(e/3,d)\. We assume 
that Qh £ Lp for p = 2 or p = oo, and \\gh\\p < M. We now want to compute A(gh,e) on a 
quantum computer such that (with e/3 now replaced by e) 

R 

J2\S N (g h )-A(g h ,e)\ 2 <e 2 (15) 

h=l 



with probability at least 4. In our case the sequences g^ are the terms of fll4|) and we assume 
that we can compute gh(j) = ffaj) exp {—2nih ■ Xj). The cost c of computing one function 
value gh(j) is now equal to c(d) + 2d + 2, since we can compute gh(j) using one evaluation 
of / and 2d + 2 combinatory operations needed to compute the inner product y = h- Xj and 
f{xj) exp(— 2my). The cost of one call of the oracle is roughly 

logN + c(d) + 2d + 2, (16) 

since we need about log iV qubits and the cost of computing is c(d) + 2d + 2. 

This summation problem can be solved by the Grover search or amplitude amplification 
algorithm mentioned in Section 5.1. To guarantee that the bound ( |15|) holds it is enough to 
compute an approximation for each component with error S = eR~ l l 2 . We will assume that 

M5 _ t = MJ^_ ^ ^ (17) 

We can satisfy (O) by computing each SN(gh) independently for each h. 
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We begin with the case p = oo. To compute one sum with error 5 with probability 
at least 1 — 77 we need roughly \ogrj~ 1 repetitions of the algorithm and this requires about 
(M/S)\ogrj~ 1 queries. We put rjR — -t to obtain an algorithm that computes each sum in 
such a way that (|15|) holds. Hence we need roug hly MVE\ og R 

queries for each g h . Together 

we need roughly 



MVR , , , 

R ■ ■ log R queries. (18) 

£ 

The case p = 2 is similar and we need roughly 

„ M^R , 3/2 M\fR , , Mv^R , „ 

it- -log' -log log -log it queries. (19) 

£ £ E 

The total cost is of order 



M\/R 

(log N + c(d) + 2d + 2) log i? for p = oo, (20) 

(log N + c(d) + 2d + 2) i?^? log 3/2 ■ log log ^5 . i g R ioTp = 2 . (21) 

5.4 Results on Tractability 

We only consider upper bounds for the class A std and weighted Korobov spaces for a > 1 
and s 7 < oo. We combine the idea ^From Subsection |0] together with the upper bounds 
from Subsection 5.3 . We need estimates for the numbers N, M, and R. 

We know from Section 2 that R(e/3,d) is the set of indices h for which the eigenvalues 
of Wd are greater than £ 2 /9, see (^|). We also know from (0) that the cardinality of the set 
R(e/3, d) is exactly equal to comp wor (£:/3, H d , A al1 ) and that for any positive rj there exists a 
positive C v such that 

R= \R(s/3,d)\ = comp wor {E/3,H d ,A aU ) < C v £-^^ + ^ Vd = 1, 2, . . . , Ve e (0,1). 
For / G Hd with < 1 we know that 

1/2 



\f(y)\ - 

where ( is the Riemann zeta function, and hence 



(f,K d (-,y))\ < K d (y,y)^ 2 = f[ (l + 2 7j C(a)\ 

.7=1 ^ ' 



\f(y)\<exp (C(«)X>Y 

V 1=1 ' 
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This means that when Ejli Ij < 00 we can apply the results from Section O with p = 00 
and M independent of d and of order one. 

=i7i — which happens when s 7 > 1 and could happen if s 7 — 1, we use the 
quantum results for p = 2 and need estimates not only for N in (|14D but also for M that 
bounds the -norms of the terms in fll4]) . 

We know from Lemma 2 (ii) in that there are lattice rules QN,d{f) — N' 1 J2f=i f( x j) 
with prime N and Xj = {jz/N} for some non-zero integer z G [— N/2, N/2] d and with {■} 
denoting the fractional part, for which 

, r\ d 1 (l + 27,-) 1/2 
I INT d (/) - Q N , d (f) I < ■ 11/11* ( 22 ) 



As in Section |5.2| , we have to guarantee an error 5 = eR~ 1 ^ 2 = 0(s 1+ ( p *( A )+ 2 )/ 2 ) for all 
integrands a; 1— > //t(x) = f(x) exp(—2nih ■ x) with h G R(e/3, d). For these integrands we 
have 

^a(7,j) 



< 



Y,\hh + 3)\ 2 r a (l } h + 3 )\ max 



2 



max 



n*(7,j" 



IIJIId 7^r a ( 7 ,/i + j)' 
We now show that 



r a (l,h + j) 



m=l 

Indeed, since r a is a product, it is enough to check ( |2"BD for all components of r a . For the 
mth component it is easy to check that 

— — — v max^r,7 m z )r a \^ 

m 1 ' b m ) 1 

Ta,\imi i^m 1 Jm) 

^From which (|2"3| ) follows. 

In our case s 7 < 00 which implies that 7 m tends to zero and therefore []™ =1 max(l, 7 m 2°) 
is finite. Furthermore, for h G R(e/3,d) we have r a (j,h) < 9/e 2 . Hence, \\fh\\d = 0(1/ e) 
for all h G R(e/3, d). We replace 7j - by 1 in fl22|) and have 

I INT d (A) - Q^(A) I = O f = 0(£ 1+ ^ Aall )+")/ 2 ) 
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if we take N at least of order 

4+p*(A all )+r; 

or 

logiV x d + loge -1 . 

To bound M we need to consider the -norms of the terms fh(xj) = 9h{j) in fll4P . Since 
the Korobov space Hd is an algebra, see Appendix 2, we know that \fh\ 2 G Hd and 

\\\h?\U < C{d)-\\f h \\l = 0(C(d)s- 2 ), 

where C(d) is given in Appendix 2. Applying the bound (B^) to the function \ fh\ 2 , we obtain 
a bound, in the L^-norm, of the sequence zu = (gh(j))j=i,...,N — (fh( x j))j=i,...,N- This is the 
number M that we need in our estimates. We obtain 

\\z h \\lu < M 2 = INT d (|M 2 ) + 0(3 d / 2 C(rf)e- 2 iV- 1 / 2 ). 

Obviously, 

int,(|m 2 ) = \mo) = ^|/> + j)l 2 < 11/112 <i 

To guarantee that M does not depend on d and is of order 1, we take iV such that 

logiV x d + \ogC(d) + loge:~ 1 x d + logs' 1 , 

since log C(d) is of order d due to Appendix 2. 

Putting these estimates together, we obtain estimates for the quantum algorithm. We 
use about d + loge -1 qubits. The total cost of the algorithm is of order 

/-, \ l+3(p*(A all )+^)/2 
(C(d) + d) -j 

Hence, the only dependence on d is through c(d) and d. We summarize this analysis in the 
following theorem. 

Theorem 4 Consider approximation APP^ : Hd —>■ -^([O, l] d ) in the quantum setting with 
a > 1 in the class A std . Assume that s 7 < oo. Then we have strong tractability. The 
quantum algorithm solves the problem to within e with probability at least | and uses about 
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d + loge 1 qubits. For any positive 5 there exists a positive number Kg such that the total 
cost of the algorithm is bounded by 

/jx l+3(p*(A all )+5)/2 x 

K 6 I (c(d) + d) - ) Vd= 1,2,... , Ve 6(0,1). 



It is interesting to compare the results in the quantum setting with the results in the 
worst case and randomized settings for the class A std . We ignore the small parameter 5 in 
Theorems 1, 2, 4 and 6. Then if s 7 > 1, the quantum setting (as well as the randomized 
setting) breaks intractability of approximation in the worst case setting (again for the class 
A std ). The number of quantum queries and quantum combinatory operations is of order 
£: -i-3p*(A all )/2^ j s s ma ll er than the corresponding number of function values in the 

randomized setting only if p*(A a11 ) < 2. However, the number of quantum combinatory 
operations is always significantly smaller than the corresponding number of combinatory 
operations in the randomized settings. 



6 Appendix 1: Quantum Algorithms 

We present a framework for quantum algorithms, see || for more details. Let D, K be 
nonempty sets, and let T{D, K) denote the set of all functions from D to K. Let IK, the 
scalar field, be either the field of real numbers M or the field of complex numbers C, and let 
G be a normed space with scalar field K. Let S : F — > G be a mapping, where F C T{D, K). 
We approximate S(f) for / 6 F by means of quantum computations. Let Hi be the two- 
dimensional complex Hilbert space C 2 , with its unit vector basis {eo, e^}, and let 

H m = Hx ® • ■ • ® R x 

be the m-fold tensor product of Hi, endowed with the tensor Hilbert space structure. It is 
convenient to let 

Z[0,N) := {0,...,N- 1} 

for N G N (as usual, N = {1, 2, ... } and N = N U {0}). Let C m = {\i) : i 6 Z[0, 2 m )} be 
the canonical basis of H m , where \i) stands for ej (g> • • • ® Zjm-n and % = YHk=o jk?™' 1 '** is 
the binary expansion of i. Denote the set of unitary operators on H m by U{H m ). 
A quantum query on F is given by a tuple 

Q = (m, m',m",Z, r, /3), (24) 
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where m, to', m" G N, wl + m" < to, Z C Z[0, 2 m ') is a nonempty subset, and 

t : Z -> £> 
(3 : K ^ Z[0,2 m ") 

are arbitrary mappings. Denote m(Q) := to, the number of qubits of Q. 

Given such a query Q, we define for each / G F the unitary operator Qf by setting for 

|i) \x) \y) eC m = C m > <g> C m » ® C m _ m /_ m »: 

V/ IV 1^ I?/; - | ^ |^ | y ) otherwise, [Zb) 

where © means addition modulo 2 m ". Hence the query uses m! bits to represent the index 
i which is used to define the argument r(i) at which the function is evaluated. We assume 
that the cost of one evaluation of / is c. The value of /(t(z)) is then coded by the mapping /3 
using m" bits. Usually, the mapping (3 is chosen in a such a way that the m" most significant 
bits of /3(f(r(i))) are stored. The number of bits that are processed is m' + m" < m, and 
usually m! + m" is insignificantly less than m. That is why we define the cost of one query 
as m + c. 

A quantum algorithm on F with no measurement is a tuple A = (Q, (Uj) T j =0 ), where Q 
is a quantum query on F, n G No and Uj G U(H m ) (j = 0, . . . , n), with m = m(Q). Given 
/ G F, we let Af G U(H m ) be defined as 

A f = U n Q f U n - 1 ...U 1 QfU . (26) 

We denote by n q (A) := n the number of queries and by m(A) = m = m{Q) the number of 
qubits of A. Let (Af(x, y)) x ,yec m be the matrix of the transformation Af in the canonical 
basis C m , A f (x,y) = (x\A f \y). 

A quantum algorithm on F with output in G (or shortly, from F to G) with k measure- 
ments is a tuple 

A = ((A,)tl(h)tlv), 
where k G N, and An (£ = 0, . . . , k — 1) are quantum algorithms on F with no measurements, 

6 GZ[0,2 mo ), 

for 1 < i < k — 1, be is a function 



6* : JJZ[0,2 mi ) -> Z[0,2 m< ), 



i=0 
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where we denoted mi := m(Ag), and ip is a function 

fc-i 

if : JJZ[0,2 m <) -> G 

with values in G. The output of A at input / e F will be a probability measure A(f) on G, 
defined as follows: First put 

PA,f( x o>---> x k-i) = \ A o,f( x o,b )\ 2 \A 1J (x 1: b 1 (x ))\ 2 . . . 

. . . |A fe _ij(xjk_i,fejfc_i(xo, . . . ,x fc _ 2 ))| 2 - (27) 

Then define by setting 

Mf)(C)= /M./l-'n r, ; ) VGCG. (28) 

^(lov^fc-i)ec 

We let n 9 (A) := X)*=o n q(Ae) denote the number of queries used by A. For brevity we say 
A is a quantum algorithm if A is a quantum algorithm with k measurements for k > 0. 

Informally, such an algorithm A starts with a fixed basis state bo and function /, and 
applies in an alternating way unitary transformations Uj (not depending on /) and the 
operator Qf of a certain query. After a fixed number of steps the resulting state is measured, 
which gives a (random) basis state, say £ - This state is memorized and then transformed 
(e.g., by a classical computation, which is symbolized by b\) into a new basis state &i(£o)- 
This is the starting state to which the next sequence of quantum operations is applied 
(with possibly another query and number of qubits). The resulting state is again measured, 
which gives the (random) basis state £i. This state is memorized, &2(£o?£i) is computed 
(classically), and so on. After k such cycles, we obtain £ , . . . Then finally an element 

<^(£o, • • • , £fc-i) of G is computed (e.g., again on a classical computer) from the results of all 
measurements. The probability measure A(f) is its distribution. 

The error of A is defined as follows: Let < 5 < 1, / 6 F, and let ( be any random 
variable with distribution A(f). Then put e(S, A, f, 6) = inf {e \ P{||S'(/) - Cll > e} < 9} . 
Associated with this we introduce 

e(S,A,F,9) = supe(S,A,f,9), 
e(S,A,f)=e(S,A,f,\), 

and 

e(S, A, F) = e(S, A, F , \) = sup e(S, A, /). 

feF 
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Of course one could easily replace here \ by another positive number a < \. The nth minimal 
query error is defined for n G No as 

e1(S, F) = inf{e(S', A, F) \ A is any quantum algorithm with n q (A) < n}. 

This is the minimal error which can be reached using at most n queries. The quantum query 
complexity is defined for e > by 

comp qq (e, S, F) = mm{n q (A) \ A is any quantum algorithm with e(S, A, F) < e}. 

The quantities e1(S, F) and comp qq (£, S, F) are inverse to each other in the following sense: 
For all n G No and e > 0, e1(S, F) < e if and only if comp qq (e 1 , S,F) < n for all E\ > e. 
Thus, determining the query complexity is equivalent to determining the nth minimal query 
error. The total (quantum) complexity comp qua (e, S, F) is defined similarly. Here we count 
the number of quantum gates that are used by the algorithm; if function values are needed 
then we put c as the cost of one function evaluation. /,From a practical point of view, the 
number of available qubits in the near future will be severely limited. Hence it is a good 
idea to present algorithms that only use a small number of qubits. 

7 Appendix 2: Korobov Spaces are Algebras 

We show that the Korobov space H d is an algebra for a > 1. More precisely, we prove that 
if /, g G H d then fg G H d and 

\\fg\U < C(d)\\f\\ d \\g\\ d , (29) 

with 



C(d) = 2 dm ^ a/2) f[ fl + 2 7j C(a) 

.7 = 1 ^ 



1/2 



For f(x) = f(j) exp(2irij ■ x) and g{x) = J2 k g{k) exp(27iik ■ x), with j and k varying 
through Z d , we have 

f(x)g(x) = f(j)g( k ) exp(2vri(j + k) ■ x) = ^ ( S fO)d(h - j) I exp(2irih ■ x). 

j k h \ j / 

Hence, we need to estimate 

11/0112 = E Y.^)g{h-j)rT{i,h) 
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Observe that 

^ /2 (7m, h m ) < c {r l J 2 {j m , k m ) + r l J 2 {j m , h m - k m )) \/ k m e Z, 

with c = 2 max (°'( a ~ 2 )/ 2 ). This holds for h m = since c > 1 and r a (^ m , k m ) > 1, and is also 
true for /i m 7^ and k m = 0. For other values of h m and k m , the inequality is equivalent to 
\hm\ a/2 < c{\k m \ a l 2 + \h m -k m \ a ' 2 ) which holds with c = 1 for a/2 < 1, and with c = 2 { - a ~ 2 ^ 2 
for a/2 > 1 by the use of the standard argument. Applying this inequality d times we get 



r l/2 
a 



(7, h) < c d II {r l J 2 { lm , k m ) + r l J 2 ( lm , h m - k m )) WkeZ d . 



m=l 



Let D — {1, 2, . . . , d} and let u C D. By u = D — u we denote the complement of u. Define 
rJd,K) = \\_r a ^ m ,h m ), r a {j,hu) = JJ r a (>y m , h m ). 

Then we can rewrite the last inequality as 

rl /2 (l, h)<c d J2 ^ /2 (7, feu) r l J 2 (l, h M - ha) V k E Z d . 

uCD 

For u C D, we define 

K(x) = ^|/(j)|ri /2 ( 7 ,j u )exp(2«j-x), 

G«(a;) = E \9U) I r « /2 (7, fc) exp(2mj ■ x). 

j 

Observe that F u and Gu are well defined functions in L 2 ([0, l] ') since r a (^,j u ) < r a (^,j) for 
all m and since / and g are from H d . In terms of these functions we see that 

E fU)9(h - j) ri /2 (7, h) < E - j)ki /2 (7, h) 

mC-D j 
aC-D j 



32 



Therefore 



h \uCD j J 



Since the sum with respect to u has 2 d terms, we estimate the square of the sum of these 2 d 
terms by the sum of the squared terms multiplied by 2 d , and obtain 

\\fg\\* < 2V d 5> u , 

uCD 

where 



a « = e(e^')^-^) 



We now estimate a u . Each h and j may be written as h = (h u , hu) and j = (j u ,ju), and 
therefore 

a u = E E ( E E ^«o«» j«) ^uCiu - ju, ^ - k) ) 

\ 3u ju / 

= E E ( E E ^ hu ~ Ju > ^rO'u, h u - ju) ) 

hu hy; \ ju ju / 

^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ F u {h u j U; ju)F u (h u k Uj ky^Gu^ju^ hu ju)Gu{k u ^ ku). 

hu h~u ju jll k u kjj; 



Note that 



where 



Similarly 



where 



Gu(j u , hu - ju)Gu(k u , - ku) < G(j u )G(k u ), 



hu 



\ h„ J 



1/2 



^2 Fu(h u - j u ,3u) F u (h u - k u , ku) < F(ju)F(ku), 



F ^) = lY^FuihuJu) 

\ hu 



2 
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We obtain 



J~ k u k— \ j— / \ ku / 



Observe that 

1/2 



J~ / \ Jtt \ 3u / 

< E ( E ^tim •?«) 2r « ( ^' ^r) ) ( E r ° ) 

J~ \ Ju / \ 3u / 

EE i/o«'i«)i 2r «(7,j«) r a(7,j«) J ( E 7 ^ 1 ^-^) ) 

J" / \ 3u / 



For the last sum we have 



Similarly, 



E^ 1 ^) = n (i+TmE^r) = n( 1+2 ^(«)). 

j'u mS \ j^O / men 

E G (^)) ^ IbUSE*"-^*-) = IbllS n( 1 + 2 ^(«)). 

\ k u / fc u mgu 

Putting all these estimates together we conclude that 

\\fg\\l < 2 d c 2d WfWl \\g\\l n( 1 + 2 ^(«))n( 1 + 2 ^C(«)) 



CD rndu m£u 

d 



= 2 d c M ]T ||/||^ |M|^n (l + 27mC(a) 

uCD m=l ^ 

= 4 d c M n (i + 2 7m C(«)) WfWl \\g\\i 

m=l ^ ' 
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^From which (|2"5|) easily follows. 

For the quantum setting, we need to consider the function w(x) = f(x)f(x) = |/(x)| 2 

for / e Hd- Note that / also belongs to H d and ||/||d = \\f\\d, since f(h) = f(—h) and 
r a (7, h) = r a (7, — h) for all ft, e Z d . Then (p9|) guarantees that w E Hd and 

II l/l 2 L < C(d) 11/112 v/ef, (30) 
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